APPENDIX 



(1) GENERAL INFORMATION: SE0U ^CE LISTING 

(i) APPLICANT: Seed, Brian et al. 

(ii) TITLE OF INVENTION • P-qpT P-pttxt 

AND METHODS SELECTIN LIGANDS AND RELATED MOLECULES 

(iii) NUMBER OF SEQUENCES: 16 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Clark & Elbing LLP 

(B) STREET: 585 Commercial Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02109-1024 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #l.o. Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/756 018 

(B) FILING DATE: 25-NOV-96 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/661 960 

(B) FILING DATE: 12-JUN-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/000 213 

(B) FILING DATE: 14-JUN-1995 

(viii) ATTORNEY/ AGENT INFORMATION- 

(A) NAME: Elbing, Karen Lech 

(B) REGISTRATION NUMBER: 35,23 8 

(C) REFERENCE/DOCKET NUMBER: 00786/284002 

(ix) TELECOMMUNICATION INFORMATION- 

(A) TELEPHONE: 617/723-6777 

(B) TELEFAX: 617/723-8962 

(C) TELEX: 

2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



Ala Thr Glu Ala Gin Thr Thr Pro Pro Ala 

10 



1 5 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 

Met Ala Thr Asn Ser Leu Glu Thr Ser Thr Gly Thr Ser Gly Pro Pro 
Val Thr 



5 10 15 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Gin Leu Trp Asp Thr Trp Ala Asp Glu Ala Glu Lys Ala Leu Gly Pro 



10 



15 

Leu Leu Ala Arg Asp Arg Arg Gin Ala Thr Glu Tyr Glu Tyr Leu Asp 
20 2 5 30 

Tyr Asp Phe Leu Pro Glu Thr Glu Pro Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 



(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Arg Asp Arg Arg Gln Ala Thr Glu ^ ^ ^ A ^ p ^ 



10 15 



Leu Pro Glu Thr 
20 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 : 

Arg Asp Arg Arg Gin Ala Thr Glu Phe Glu Phe Leu Asp Phe Asp Phe 



io 15 



Leu Pro Glu Thr 
20 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Arg Asp Arg Arg Gin Ala Ala Glu Tyr Glu Tvr * m 

1 q iyr tjlu T Y r Le u Asp Tyr Asp Phe 

3 in 



io 15 



Leu Pro Glu Ala 
20 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp Arg Arg Gln Ala Ala ^ ^ A ^ ^ a ^ ^ 



10 15 



Leu Pro Glu Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2287 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
A^ACCA CCATGGACTG GACCXGGAGG TTCCTCTTCT TTGTGGTGGC AGCAGCTACA 
GGTGTCCAGT CCCAGGTGCA GCTGG TC CAG TCTGGGGCTG AGGTGAAGAA GCC^GGTCC 
TCQGTGAAGG TCTCCTGCAA GGCTTCTGGA GGCACCTTCA GCAGCTATGC TATCAGCTGG 
GTGCGACAGG CCCCTGGACA AGGGCTTGAG TGGATGGGAG GGATCATCCC TATCTTTGGT 
ACAGCAAACT ACGCACAGAA GTTCCAGGGC AGAGTCACGA TTACCGCGGA CGAATCCACG 
AGCACAGCCT ACATCGAGCT GAGCAGCCTG AGATCTGAGG ACACGGCCGT GTAITACTGT 
GCGAGAGATA ATGGAGCGTA TTGTAGTGGT GGTAGCTGCT ACTCGGGCTG GTTCGACCCC 
TGGGGCCAGG GAACCCTGGT CACCGTCTCT TCAGGTGAGT ACTGAATTCT AGCTTTCTGG 
GGCAGGCCAG GCCTGACCTT GGCTTTGGGG CAGGGAGGGG GCTAAGGTGA GGCAGGTGGC 
GCCAGCAGGT GCACACCCAA TGCCCATGAG CCCAGACACT GGACGCTCAA CCTCGCGGAC 
AGTTAAGAAC CCAGGGGCCT CTGCGCC^G GCCCAGCTCT GTGCGACAGG GCGGTCACAT 
SGCACCACCT CTCTTGCAGC CTCCACCAAG GGCCCATCGG TCTTCCCCCT GGCACCCTCC 
TCCAAGAGCA CCTCTGGGGG CACAGCGGCC CTGGGCTGCC TGGTCAAGGA CTACTTCCCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



GAACCGGTGA CGGTGTCGTG GAACTCAGGC GCCCTGACCA GCGGCGTGCA CACCTTCCCG 
GCTGTCCTAC AGTCCTCAGG ACTCTACTCC CTCAGCAGCG TGGTGACCGT GCCCTCCAGC 
AGCTTGGGCA CCCAGACCTA CATCTGCAAC GTGAATCACA AGCCCAGCAA CACCAAGGTG 
GACAAGAAAG TTGGTGAGAG GCCAGCACAG GGAGGGAGGG TGTCTGCTGG AAGCAGGCTC 
AGCGCTCCTG CCTGGACGCA TCCCGGCTAT GCAGCCCCAG TCCAGGGCAG CAAGGCAGGC 
CCCGTCTGCC TCTTCACCCG GAGCCTCTGC CCGCCCCACT CATGCTCAGG GAGAGGGTCT 
TCTGGCTTTT TCCCAGGCTC TGGGCAGGCA CAGGCTAGGT GCCCCTAACC CAGGCCCTGC 
ACACAAAGGG GCAGGTGCTG GGCTCAGACC TGCCAAGAGC CATATCCGGG AGGACCCTGC 
CCCTGACCTA AGCCCACCCC AAAGGCCAAA CTCTCCACTC CCTCAGCTCG GACACCTTCT 
CTCCTCCCAG ATTCCAGTAA CTCCCAATCT TCTCTCTGCA GAGCCCAAAT CTTGTGACAA 
AACTCACACA TGCCCACCGT GCCCAGGTAA GCCAGCCCAG GCCTCGCCCT CCAGCTCAAG 
GCGGGACAGG TGCCCTAGAG TAGCCTGCAT CCAGGGACAG GCCCCAGCCG GGTGCTGACA 
CGTCCACCTC CATCTCTTCC TCAGCACCTG AACTCCTGGG GGGACCGTCA GTCTTCCTCT 
TCCCCCCAAA ACCCAAGGAC ACCCTCATGA TCTCCCGGAC CCCTGAGGTC ACATGCGTGG 
TGGTGGACGT GAGCCACGAA GACCCTGAGG TCAAGTTCAA CTGGTACGTG GACGGCGTGG 
AGGTGCATAA TGCCAAGACA AAGCCGCGGG AGGAGCAGTA CAACAGCACG TACCGGGTGG 
TCAGCGTCCT CACCGTCCTG CACCAGGACT GGCTGAATGG CAAGGAGTAC AAGTGCAAGG 
TCTCCAACAA AGCCCTCCCA GCCCCCATCG AGAAAACCAT CTCCAAAGCC AAAGGTGGGA 
CCCGTGGGGT GCGAGGGCCA CATGGACAGA GGCCGGCTCG GCCCACCCTC TGCCCTGAGA 
GTGACCGCTG TACCAACCTC TGTCCTACAG GGCAGCCCCG AGAACCACAG GTGTACACCC 
TGCCCCCATC CCGGGATGAG CTGACCAAGA ACCAGGTCAG CCTGACCTGC CTGGTCAAAG 
GCTTCTATCC CAGCGACATC GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT 
ACAAGACCAC GCCTCCCGTG CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA 
CCGTGGACAA GAGCAGGTGG CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG 
CTCTGCACAA CCACTACACG CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA TGAGTGCGAC 
GGCCGGC 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2287 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Lys Leu Thr Thr Met Asp Trp Thr Trp Arg Phe Leu Phe Phe Val Val 
15 10 15 

Ala Ala Ala Thr Gly Val Gin Ser Gin Val Gin Leu Val Gin Ser Gly 
20 25 30 

Ala Glu Val Lys Lys Pro Gly Ser Ser Val Lys Val Ser Cys Lys Ala 
35 40 45 

Ser Gly Gly Thr Phe Ser Ser Tyr Ala lie Ser Trp Val Arg Gin Ala 
50 55 60 

Pro Gly Gin Gly Leu Glu Trp Met Gly Gly He He Pro He Phe Gly 
65 70 75 80 

Thr Ala Asn Tyr Ala Gin Lys Phe Gin Gly Arg Val Thr He Thr Ala 
85 90 95 

Asp Glu Ser Thr Ala Arg Asp Asn Gly Ala Tyr Cys Ser Gly Gly Ser 
100 105 HO 

Cys Tyr Ser Gly Trp Phe Asp Pro Trp Gly Gin Gly Thr Leu Val Thr 
115 120 125 

Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro 
130 135 140 

Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val 
145 150 155 160 

Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala 
165 170 175 

Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly 
180 185 190 

Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Asp Lys 
195 200 205 



Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys 
210 215 220 



Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro 
225 230 235 240 

Lys Pro Lys Asp Thr Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys 
245 250 255 

Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp 
260 265 270 

Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu 
275 280 285 

Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu 
290 295 300 

His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 
305 310 315 320 

Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys Gly 
325 330 335 

Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu 
340 345 350 

Leu Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr 
355 360 365 

Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn 
370 375 380 

Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe 
385 390 395 400 

Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn 
405 410 415 

Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr 
420 425 430 

Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
435 440 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGCGCTGT CCTGGGTTCT TACAGTCCTG AGCCTCCTAC CTCTGCTGGA AGCCCAGATC 60 

CCATTGTGTG CCAACCTAGT ACCGGTGCCC ATCACCAACG CCACCCTGGA CCAGATCACT 12 0 

GGCAAGTGGT TTTATATCGC ATCGGCCTTT CGAAACGAGG AGTACAATAA GTCGGTTCAG 18 0 

GAGATC CAAG CAACCTTCTT TTACTTCACC CCCAACAAGA CAGAGGACAC GATCTTTCTC 24 0 

AGAGAGTACC AGACCCGACA GGACCAGTGC ATC TATAACA CCACCTACCT GAATGTCCAG 3 00 

CGGGAAAATG GGACCATCTC CAGATACGTG GGAGGCCAAG AGCATTTCGC TCACTTGCTG 3 60 

ATCCTCAGGG AC AC C AAGAC CTACATGCTT GCTTTTGACG TGAACGATGA GAAGAAC TGG 42 0 

GGGCTGTCTG TCTATGCTGA CAAGCCAGAG ACGACCAAGG AGCAACTGGG AGAGTTCTAC 4 80 

GAAGCTCTCG ACTGCTTGCG CATTCCCAAG TCAGATGTCG TGTACACCGA TTGGAAAAAG 540 

GATAAGTGTG AGCCACTGGA GAAGCAGCAC GAGAAGGAGA GGAAACAGGA GGAGGGGGAA 600 

TCGGATCCCG AGGGTGAGTA CTAAGCTTCA GCGCTCCTGC CTGGACGCAT CCCGGCTATG 660 

CAGCCCCAGT CCAGGGCAGC AAGGCAGGCC CCGTCTGCCT CTTCACCCGG AGCCTCTGCC 72 0 

CGCCCCACTC ATGCTCAGGG AGAGGGTCTT CTGGCTTTTT CCCAGGCTCT GGGCAGGCAC 7 80 

AGGCTAGGTG CCCCTAACCC AGGCCCTGCA CACAAAGGGG CAGGTGCTGG GCTCAGACCT 840 

GCCAAGAGCC ATATCCGGGA GGACCCTGCC CCTGACCTAA GCCCACCCCA AAGGCCAAAC 900 

TCTCCACTCC CTCAGCTCGG ACACCTTCTC TCCTCCCAGA TTCCAGTAAC TCCCAATCTT 960 

CTCTCTGCAG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCACCGTG CCCAGGTAAG 102 0 

CCAGCCCAGG CCTCGCCCTC CAGCTCAAGG CGGGACAGGT GCCCTAGAGT AGCCTGCATC 10 8 0 

CAGGGACAGG CCCCAGCCGG GTGCTGACAC GTCCACCTCC ATCTCTTCCT CAGCACCTGA 114 0 

ACTCCTGGGG GGACCGTCAG TCTTCCTCTT CCCCCCAAAA CCCAAGGACA CCCTCATGAT 12 00 

CTCCCGGACC CCTGAGGTCA CATGCGTGGT GGTGGACGTG AGCCACGAAG ACCCTGAGGT 12 60 

CAAGTTCAAC TGGTACGTGG ACGGCGTGGA GGTGCATAAT GCCAAGACAA AGCCGCGGGA 13 2 0 

GGAGCAGTAC AACAGCACGT ACCGGGTGGT CAGCGTCCTC ACCGTCCTGC ACCAGGACTG 13 80 

GCTGAATGGC AAGGAGTACA AGTGCAAGGT CTCCAACAAA GCCCTCCCAG CCCCCATCGA 144 0 

GAAAAC CATC TCCAAAGCCA AAGGTGGGAC CCGTGGGGTG CGAGGGCCAC ATGGACAGAG 150 0 

GCCGGCTCGG CCCACCCTCT GCCCTGAGAG TGACCGCTGT ACCAACCTCT GTCCTACAGG 1560 

GCAGCCCCGA GAACCACAGG TGTACACCCT GCCCCCATCC CGGGATGAGC TGACCAAGAA 162 0 



CCAGGTCAGC CTGACCTGCC TGGTCAAAGG CTTCTATCCC AGCGACATCG CCGTGGAGTG 1680 

GGAGAGCAAT GGGCAGCCGG AGAACAACTA CAAGACCACG CCTCCCGTGC TGGACTCCGA 1740 

CGGCTCCTTC TTCCTCTACA GCAAGCTCAC CGTGGACAAG AGCAGGTGGC AGCAGGGGAA 1800 

CGTCTTCTCA TGCTCCGTGA TGCATGAGGC TCTGCACAAC CACTACACGC AGAAGAGCCT 1860 



CTCCCTGTCT CCGGGTAAAT GAGTGCGACG GCCG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Leu Ser Trp Val Leu Thr Val Leu Ser Leu Leu Pro Leu Leu 
1 5 10 15 

Glu Ala Gin He Pro Leu Cys Ala Asn Leu Val Pro Val Pro He Thr 
20 25 30 

Asn Ala Thr Leu Asp Gin He Thr Gly Lys Trp Phe Tyr He Ala Ser 
35 40 45 

Ala Phe Arg Asn Glu Glu Tyr Asn Lys Ser Val Gin Glu He Gin Ala 
50 55 60 



1894 



Thr Phe Phe Tyr Phe Thr Pro Asn Lys Thr Glu Asp Thr lie Phe Leu 
65 70 75 80 

Arg Glu Tyr Gin Thr Arg Gin Asp Gin Cys He Tyr Asn Thr Thr Tyr 
85 90 95 

Leu Asn Val Gin Arg Glu Asn Gly Thr He Ser Arg Tyr Val Gly Gly 
100 105 HO 

Gin Glu His Phe Ala His Leu Leu He Leu Arg Asp Thr Lys Thr Tyr 
115 120 125 

Met Leu Ala Phe Asp Val Asn Asp Glu Lys Asn Trp Gly Leu Ser Val 
130 135 140 

Tyr Ala Asp Lys Pro Glu Thr Thr Lys Glu Gin Leu Gly Glu Phe Tyr 
145 150 155 160 

Glu Ala Leu Asp Cys Leu Arg He Pro Lys Ser Asp Val Val Tyr Thr 
165 170 I 75 

Asp Trp Lys Lys Asp Lys Cys Glu Pro Leu Glu Lys Gin His Glu Lys 
180 185 190 

Glu Arg Lys Gin Glu Glu Gly Glu Ser Asp Pro Glu Gly Glu Pro Lys 
195 200 205 

Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu 
210 215 220 

Leu Glv Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr 

„ n 235 240 

225 230 til 

Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val 

245 250 255 

Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val 
260 265 270 

Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser 
275 280 285 

Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp Leu 
290 295 300 

Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala 
305 310 315 320 

Pro He Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro 
325 330 335 

Gin Val Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gin 
340 345 350 



Val Ser Leu Thr 
355 

Val Glu Trp Glu 
370 

Pro Pro Val Leu 
385 

Thr Val Asp Lys 



Val Met His Glu 
420 



Cys Leu Val Lys 
360 

Ser Asn Gly Gin 
375 

Asp Ser Asp Gly 
390 

Ser Arg Trp Gin 
405 

Ala Leu His Asn 



Gly Phe Tyr Pro 



Pro Glu Asn Asn 
380 

Ser Phe Phe Leu 
395 

Gin Gly Asn Val 
410 

His Tyr Thr Gin 
425 



Ser Asp lie Ala 
365 

Tyr Lys Thr Thr 



Tyr Ser Lys Leu 
400 

Phe Ser Cys Ser 
415 

Lys Ser Leu Ser 
430 



Leu Ser Pro Gly Lys 
435 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 



Lys Leu Thr Thr Met Asp Trp Thr Trp Arg Phe Leu Phe Phe Val Val 
* 10 15 



Ala 



Ala Ala Thr Gly Val Gin Ser Gin Val Gin Leu Val Gin Ser Gly 
20 25 30 



Ala Glu Val Lys Lys Pro Gly Ser Ser Val Lys Val Ser Cys Lys Ala 
35 40 45 

Ser Gly Gly Thr Phe Ser Ser Tyr Ala lie Ser Trp Val Arg Gin Ala 



50 



55 60 



Pro Gly Gin Gly Leu Glu Trp Met Gly Gly He lie Pro He Phe Gly 
65 70 75 80 

Thr Ala Asn Tyr Ala Gin Lys Phe Gin Gly Arg Val Thr lie Thr Ala 
85 90 95 

Asp Glu Ser Thr Ala Arg Asp Asn Gly Ala Tyr Cys Ser Gly Gly Ser 
100 105 HO 



Cys Tyr Ser Gly Trp 
115 

Val Ser Ser Ala Ser 
130 

Ser Ser Lys Ser Thr 
145 

Lys Asp Tyr Phe Pro 
165 

Leu Thr Ser Gly Val 
180 

Leu Tyr Ser Leu Ser 
195 

Lys Val Glu Pro Lys 
210 

Pro Ala Pro Glu Leu 
225 



Phe Asp Pro Trp Gly Gin 
120 

Thr Lys Gly Pro Ser Val 
135 

Ser Gly Gly Thr Ala Ala 
150 155 

Glu Pro Val Thr Val Ser 
170 

His Thr Phe Pro Ala Val 
185 

Ser Val Val Thr Val Pro 
200 

Ser Cys Asp Lys Thr His 
215 

Leu Gly Gly Pro Ser Val 
230 235 




Gly Thr Leu Val Thr 
125 

Phe Pro Leu Ala Pro 
140 

Leu Gly Cys Leu Val 
160 

Trp Asn Ser Gly Ala 
175 

Leu Gin Ser Ser Gly 
190 

Ser Ser Ser Asp Lys 
205 

Thr Cys Pro Pro Cys 
220 

Phe Leu Phe Pro Pro 
240 



Lys Pro Lys Asp 



Val Val Val Asp 
260 

Tyr Val Asp Gly 
275 

Glu Asn Tyr Ser 
290 

His Gin Asp Trp 
305 

Lys Ala Leu Pro 



Gin Pro Arg Glu 
340 

Leu Thr Lys Asn 
355 

Pro Ser Asp He 
370 



Thr Leu Met He 
245 

Val Ser His Glu 



Val Glu Val His 
280 

Ser Thr Tyr Arg 
295 

Leu Asn Gly Lys 
310 

Ala Pro He Glu 
325 

Pro Gin Val Tyr 



Gin Val Ser Leu 
360 

Ala Val Glu Trp 
375 



Ser Arg Thr Pro 
250 

Asp Pro Glu Val 
265 

Asn Asn Lys Thr 



Val Val Ser Val 
300 

Glu Tyr Lys Cys 
315 

Lys Asn He Ser 
330 

Thr Leu Pro Pro 
345 

Thr Cys Leu Val 



Glu Ser Asn Gly 
380 



Glu Val Thr Cys 
255 

Asn Phe Ser Trp 
270 

Lys Pro Arg Glu 
285 

Leu Thr Val Leu 



Asn Val Ser Asn 
320 

Lys Ala Lys Gly 
335 

Ser Arg Asp Glu 
350 

Lys Gly Phe Tyr 
365 

Gin Pro Glu Asn 



Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe 

385 390 395 400 

Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn 

405 410 415 



Val Phe Ser Cys Ser Val Met His 
420 

Gin Lys Ser Leu Ser Leu Ser Pro 
435 440 



Glu Ala Leu His Asn His Tyr Thr 
425 430 

Gly Lys 



INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Glu Met Leu Arg Asn Ser Thr Asp Thr Thr Pro Leu Thr Gly 
15 10 15 

Pro Gly Thr Pro Glu Ser Thr Thr Val Glu Pro Ala Ala Arg Arg Ser 
20 25 30 

Thr Gly Leu Asp Ala Gly Gly Ala Val Thr Glu 
35 40 

INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 



Leu Thr Thr Glu Leu Ala Asn Met Gly Asn Leu Ser Thr Asp Ser Ala 
15 10 15 



• 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acids 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Thr Gly Asp Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acids 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Glu Asp Tyr Glu Tyr Asp Glu Leu Pro 
1 5 



